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Binding and Substrate Hydroxylation Step in AlkB Repair Enzymes 
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Abstract: AlkB repair enzymes are important nonheme iron 
enzymes that catalyse the demethylation of alkylated DNA 
bases in humans, which is a vital reaction in the body that 
heals externally damaged DNA bases. Its mechanism is cur- 
rently controversial and in order to resolve the catalytic 
mechanism of these enzymes, a quantum mechanics/molec- 
ular mechanics (QM/MM) study was performed on the de- 
methylation of the A/^-methyladenine fragment by AlkB 
repair enzymes. Firstly, the initial modelling identified the 
oxygen binding site of the enzyme. Secondly, the oxygen ac- 
tivation mechanism was investigated and a novel pathway 
was found, whereby the catalytically active iron(IV)-oxo 
intermediate in the catalytic cycle undergoes an initial iso- 



merisation assisted by an Arg residue in the substrate bind- 
ing pocket, which then brings the oxo group in close con- 
tact with the methyl group of the alkylated DNA base. This 
enables a subsequent rate-determining hydrogen-atom ab- 
straction on competitive a- and Jt-pathways on a quintet 
spin-state surface. These findings give evidence of different 
locations of the oxygen and substrate binding channels in 
the enzyme and the origin of the separation of the oxygen- 
bound intermediates in the catalytic cycle from substrate. 
Our studies are compared with small model complexes and 
the effect of protein and environment on the kinetics and 
mechanism is explained. 



Introduction 

Nonheme iron dioxygenases catalyse a range of important re- 
actions in Nature including the biosynthesis of antibiotics in 
microbes and the metabolism of, for instance, cysteine in 
mammals.^^'^^ In addition, nonheme iron dioxygenases have 
been linked to oxygen sensing and collagen cross-linking pro- 
cesses in the body, and as such they have vital functions for 
the biosystem,^^^ but unfortunately there are many unanswered 
questions related to their activity and the catalytic transforma- 
tion of substrates and detailed computational studies can shed 
light on this and predict a mechanism. The nonheme iron diox- 



[a] M. G. Quesne, Dr. R. Latifi, L E. Gonzalez-Ovalle, Dr. S. P. de Visser 
Manchester Institute of Biotechnology and School of 
Chemical Engineering and Analytical Science 

The University of Manchester, 131 Princess Street 

Manchester Ml 7DN (UK) 

E-mail : sam.devisser@manchester.ac.uk 

[b] Dr. R. Latifi 

Department of Chemistry, Tufts University 
MedfordMA, 02155 (USA) 

[c] Dr. D. Kumar 

Department of Applied Physics, School of Physical Sciences 
Babasaheb, Bhimrao Ambedkar University, Vidya Vihar 
Rae Bareilly Road, Lucknow 226-025 (India) 

Supporting information for this article is available on the WWW under 
http://dx.doi.org/10.1002/chem.201303282. 
^ © 2013 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. 
This is an open access article under the terms of the Creative Commons At- 
tribution License, which permits use, distribution and reproduction in any 
medium, provided the original work is properly cited. 



ygenases generally contain an iron active centre that is bound 
to the protein via a facial triad of amino acids that includes 
two histidine and one aspartate or glutamate residue in a 
2-His/1-Asp feature.^"^^ They utilise a co-substrate (a-ketogluta- 
rate, aKG) on an iron centre to convert molecular oxygen into 
a high-valent iron(IV)-oxo species, which has been proposed 
to be the active species of these enzymes.^^^ For several non- 
heme iron dioxygenases, this active species has been charac- 
terised by spectroscopic methods, including resonance Raman 
and Mossbauer spectroscopies, and it was found that the 
iron(IV)-oxo intermediate reacts by hydrogen-atom abstraction 
from the substrate with a large kinetic isotope effect.^^^ These 
experimental studies, however, mostly focused on the bacterial 
enzyme taurine/a-ketoglutarate dioxygenase. 

A special class of enzymes within the nonheme iron dioxy- 
genase family are the AlkB repair enzymes that repair methy- 
lated DNA (and RNA) bases that have been damaged by intra- 
or extracellular chemicals.^^^ This has a dramatic effect on 
normal cellular function as, for instance, A/^-methylation of an 
adenine residue of a DNA strain blocks DNA replication 
through the prevention of formation of Watson-Crick base- 
pairs.^^^ In addition, chemicals such as methylmethane sulfo- 
nate and methyl halides have been shown to generate 
A/^-methyladenine and A/^-methylcytosine. AlkB repair enzymes 
are the body's natural defence mechanism and react with 
damaged DNA and RNA chains by demethylation of these alky- 
lated bases in a reaction that uses one molecule of aKG and 
molecular oxygen.^^^ Biochemical studies were performed on 
the characterisation of the enzyme,^^°^ the protein structure,^"^ 
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and the substrate selectivity and binding of inhibitorsJ^^^ More- 
over, crystal structure coordinates were measured for the reac- 
tant as well as the substrate and hydroxylated product bound 
com p I exes J^^^ These studies highlight a tight bonding pocket 
for substrate and a catalytic mechanism that leads to hydroxyl- 
ation of the methylated DNA base. 

It has been hypothesised that the catalytic mechanism of 
AlkB repair enzymes is analogous to the nonheme iron dioxy- 
genases and proceeds by aliphatic hydroxylation of the methyl 
group followed by release of formaldehydeJ^^ Isotopic labelling 
and product distributions indeed confirmed that iron(ll), aKG 
and molecular oxygen are needed in the process and that 
formaldehyde and CO2 are formedJ^^'^^ Crystallographic data 
provided further evidence that AlkB repair enzymes belong to 
the aKG-dependent dioxygenases and highlighted a nonheme 
iron active site, where the metal is bound to a 2-His/1-Asp 
ligand system (Figure 1).^^"^^ However, a close inspection of the 
crystal structure coordinates reveals that the sixth binding site 
of the metal, that is, the O2 binding site, is located perpendicu- 
lar to the substrate orientation rather than in its vicinity. As 
such, an oxygen atom transfer within this conformation may 
be difficult and might proceed over a relatively large distance. 

Furthermore, previous computational studies of the Gauld 
group on the catalytic cycle of the AlkB repair enzyme using 
active-site models found a rate-determining hydrogen-atom 
abstraction barrier of 20.9 kcalmol"^ for methylated adenine.^^^^ 
This is a relatively high barrier for a nonheme iron(IV)-oxo 
complex, since, for a series of hy- 
drogen-atom abstraction reac- 
tions by analogous models 
much lower barriers were ob- 
tained for alternative substrates. 
As such, the value of 20.9 kcal Aspi: 
mol"^ in the gas phase would 
compare to that found for a sub- 
strate like propane,^^^^ which as 
far as we know is not hydroxylat- 
ed by nonheme iron enzymes. 
Consequently, a barrier with 
a magnitude over 20kcalmor^ 
may be a very slow process that 
is not efficient enough to 
happen in Nature. However, 
there may be effects of the pro- 
tein and the local environment 
that were not taken into consid- 
eration in the models of Gauld 
et al. that have affected the bar- ^ 
rier heights. The studies, there- Hjg^g^^^ 
fore, warrant a further set of cal- Aspi: 
culations and in particular one 
using quantum mechanics/mo- 
lecular mechanics (QM/MM) that 
takes the full scale of the protein 
and solvent into effect. 

Here we report this QM/MM 
study and focus on the catalytic 
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Figure 1. Extract of the active site of AlkB repair enzyme as taken from the 
3120 PDB file and the reaction catalysed by the enzyme. The proposed 
oxygen binding site trans to HiSi3i is filled with a water molecule. 



mechanism of oxygen activation by AlkB enzymes and the hy- 
droxylation of methylated DNA bases (Scheme 1). We investi- 
gate two possible oxygen binding positions on the iron(ll) re- 
actant complex with the superoxo either trans to H\s^^^ (struc- 
ture A) or irons to HiSigy (structure B). In the following step in 
the catalytic cycle the superoxo group is expected to attack 
the a-keto-position of aKG to give an iron(IV)-oxo species, CO2 
and succinate.^^^^ Technically two isomeric iron(IV)-oxo struc- 
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Scheme 1. Mechanism of substrate hydroxylation by an iron(IV)-oxo in AlkB repair enzymes. 
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tures are possible (R and R'), which may interconvert into each 
other. In this work we identify A as the oxygen-binding posi- 
tion and propose a novel mechanism whereby the iron(IV)- 
0X0 undergoes an isomerisation from R to R' prior to hydro- 
gen-atom abstraction. We also highlight the electronic changes 
during the reaction and the effect the isomerisation has on the 
electron-transfer pathways and the barrier heights for the reac- 
tion. 



Results and Discussion 

Oxygen binding site on the metal 

A recent QM/MM study investigated the hydrogen-atom ab- 
straction step of A/^-methyladenine (CH3Ade) by AlkB repair en- 
zymes.^^^^ They used a molecular geometry, whereby the oxo 
group was aligned with the substrate, that is, trans to HiSigy, 
and an almost linear Fe-0-HCH2-Ade angle, analogous to struc- 
ture R' in Scheme 1. However, the iron(IV)-oxo species is 
formed in a catalytic cycle from an iron(lll)-superoxo complex 
in a reaction with aKG (Scheme 1), and in the crystal structure 
displayed in Figure 1 there seems to be a dioxygen binding 
site irons to H'\s^^^. We decided, therefore, to first investigate 
the oxygen binding site of the enzyme, and thereby assign 
either R or R' as the reactant of the catalytic hydroxylation. 

We started the work with locating the oxygen binding site 
of AlkB, whereby we attempted to model dioxygen into the 
protein structure at various metal binding sites, that is, create 
structures A and B (Figure 2). Firstly, molecular oxygen was in- 
serted into the sixth binding site irons to HiSisi by replacing 
the water ligand. The binding pocket has sufficient space to 
accommodate molecular oxygen and no stereochemical clash- 
es are noted that would prevent it from binding in this posi- 
tion. The binding pocket is lined up with apolar and aromatic 
residues, such as the side chains of Ilei43, PheT54 and Trpiyg. Sol- 
vation of the protein still finds sufficient space in the binding 
pocket irons to H\s^^^ for two water molecules; this binding 
pocket, therefore, is large enough to accommodate molecular 
oxygen. Figure 2 gives the equilibrated MM structure of A. 

Thereafter, we inserted molecular oxygen into the metal po- 
sition irons to HiSigy with the distal oxygen atom on the line 
through the iron and the hydrogen atom from A/^-methylade- 
nine that is abstracted by AlkB in order to generate structure 
B. However, this structure has considerable stereochemical 
constraints and we failed to optimise its geometry and con- 
verge it to a local minimum. In particular, the proximal oxygen 
atom of the iron(lll)-superoxo with the oxygen irons to HiSigy 
gives close contacts (<1.7A) to the carboxylate group of 
AsPt33 and the methyl group of A/^-methyladenine. Further- 
more, the distal oxygen atom of the iron(lll)-superoxo has 
close contact with the carboxylate group of aKG (1.75 A), 
therefore, stereochemically it is not a stable structure. Conse- 
quently, these MM models show that molecular oxygen does 
not fit into the binding pocket in a position irons to HiSigy as it 
is too tight with too many closely packed residues. As struc- 
ture B is not a stable entity it cannot be a catalytic cycle inter- 
mediate and as a result molecular oxygen can only bind in the 
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Figure 2. Isomeric structures for dioxygen binding to tine iron(ll) centre of 
AlkB enzymes with either the O2 trans to HiSiji (top) or trans to HiSigy 
(bottom). The former structure is MIVl minimised, whereas the latter failed to 
converge due to stereochemical clashes. The 2-His/l-Asp structure is in 
purple, substrate in green and aKG in orange. 



position irons to H\s^^y that is, A. Our studies, therefore, have 
identified the molecular oxygen binding site as irons to H\s^^^ 
that in a reaction with aKG will give the iron(IV)-oxo species in 
the R conformation. This also implies that a subsequent iso- 
merisation will be needed to form the rotated iron(IV)-oxo 
species (R') with the 0x0 group irons to HiSigy, which is expect- 
ed to precede the hydrogen-atom abstraction reaction. 

The reported QM/MM results in ref. [18] did not consider 
this isomerisation and as a result may not reflect the correct 
mechanism of the chemical reaction as they may have used 
a wrong starting structure of the chemical process. To gain fur- 
ther insight into the mechanism of substrate hydroxylation by 
DNA base repair enzymes, we decided to carry out a QM/MM 
study. We started the work from the iron(IV)-oxo species as 
displayed in Figure 1 and we followed the mechanism as de- 
scribed in Scheme 1 until hydroxylated products. The work 
started from the iron(IV)-oxo species with the 0x0 group irons 
to HiSiBT (R), which is expected to isomerise to a position with 
the 0x0 group irons to HiSigy in R'. In the latter conformation 
the 0x0 group is in hydrogen bonding distance to the methyl 
group of A/^-methyladenine and takes up a hydrogen atom to 
form a radical intermediate (I) before hydroxyl rebound to 
form alcohol product complexes (P). 



Chem. Eur. J. 2014, 20, 435-446 



www.chemeurj.org 



437 © 2014 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim 




CHEMISTRY 

A European Journal 

Full Paper 



QM/MM set up and validation 

The work described in here uses the quantum mechanics/mo- 
lecular mechanics (QM/MM) procedure, whereby the inner core 
of the enzyme is described by density functional theory meth- 
ods and the rest of the protein and solvent with a molecular 
mechanics force field as explained in detail in the Experimental 
Section. Generally, the methodology follows previous QM/MM 
calculations of our group that were carefully benchmarked and 
calibratedJ^^^ The calculations start from the 3120 PDB file,^^"^^ 
which is a substrate bound iron(lll)-water complexed with aKG 
as described in Figure 1. The water ligand was replaced by an 
0X0 group and aKG by succinate to obtain our reactant struc- 
ture (R'): an iron(IV)-oxo group in a 2-His/1-Asp ligand environ- 
ment and succinate bound as a bidentate ligand. Subsequent- 
ly, hydrogen atoms were added and the protonation state of 
residues checked as described below. Thereafter, the structure 
was subjected to an iterative solvation procedure to give 
a chemical model with a total amount of 10011 atoms. 

After the set-up was completed we ran a molecular dynam- 
ics (MD) simulation for 800 ps at 298 K and 1 bar using the 
Charmm force field,^^°^ whereby the protein and solvent were 
allowed to relax. Figure 3 displays the total energy and the 
root mean square deviation (RMSD) of the protein structure of 
the MD trajectory of our reactant structure. As follows from 
Figure 3 after 20-30 ps of simulation the total energy reaches 
a plateau and the energy stabilises. At the same time the fluc- 
tuations in the protein structure and geometry stabilise and 
the RMSD values converges to a value of close to 1. These MD 
simulations match those found for the analogous nonheme 
iron enzymes proline hydroxylase that revealed conformational 
stability and rigidity after substrate binding.^^^^ 

We selected three snapshots from the MD simulations as 
starting points for the QM/MM calculations after 300, 400 and 
500 ps; designated Sn3oo, Sn4oo and Sh^qq, respectively. An over- 
lay of the three structures of these snapshots (Supporting In- 
formation, Figure SI) shows little differences in the protein ori- 
entation and the enzyme active site. The only differences ob- 
tained between these snapshots originate at distances far 
away from the reaction centre and generally are on the surface 
of the protein. We do not expect these motions and structural 
differences to be dramatic, but decided to carry out some test 
calculations for all three snapshots to determine the reproduci- 
bility and stability of the calculations. In all studies, we selected 
a large QM region containing the iron(IV)-oxo group, methyl- 
imidazole groups for H\s^^^ and HiSigy, acetate for succinate 
and AsPt33, methylguanidinium for Arg2io, and the A/^-methyl- 
adenine part of the substrate (Figure 4). 

Subsequently, we optimised the geometry of the iron(IV)- 
0X0 species using Sn^QQ, Sn^QQ and Sn^QQ in the lowest lying sin- 
glet, triplet, quintet and septet spin states. In all cases, the ge- 
ometry optimisation converged to a structure corresponding 
to R', in which the oxo-group bridges between the methyl 
group of methylated adenine and the iron atom. The quintet 
spin state is the ground state in each snapshot and the spin 
state ordering follows quintet-triplet-septet and singlet 
throughout, which implies that the quintet spin state will be 
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Figure 3. Molecular dynamics scan at 298 K of the protein, a) Plot of the 
total MlVi energy as a function of time, b) RMSD value of the protein struc- 
ture as a function of time. 
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Figure 4. QM region selected for the QM/MM calculations. Waved lines rep- 
resent QM/MM linkages that were described with the link atom method. 
Succ stands for succinate and SubH for substrate. 



the reactive state. Previous studies of nonheme iron reactivities 
showed higher barriers on the triplet spin state than on the 
quintet spin state,^^^^ therefore, it is not expected that the trip- 
let spin state will play a key role in this part of the catalytic 
cycle. Our calculations are in agreement with the experimental 
EPR characterisation of nonheme iron(IV)-oxo complexes in en- 
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zymes that were found to be in a high-spin stateJ^^^ Previous 
calculations on the iron(IV)-oxo species of DNA base repair en- 
zymes^^^'^^^ and analogous nonheme iron dioxygenases^^"^^ also 
identified it as a high-spin ground state. This contrasts biomim- 
etic nonheme iron(IV)-oxo complexes that generally are de- 
scribed with a triplet spin ground state.^^^^ Recent computa- 
tions, however, showed that pentacoordinated iron(IV)-oxo 
complexes stabilise the quintet spin state, whereas hexacoordi- 
nated iron(IV)-oxo usually has a triplet spin ground state.^^^^ 

All three snapshots give the same spin-state ordering and 
a well-separated quintet spin ground state from other states, 
therefore, we have focused in the following on the quintet 
spin mechanism of oxygen atom transfer only. Optimised geo- 
metries of ^R' as calculated in Sn3oo, Sn4oo and Sn^oQ are given in 
Figure 5. The iron(IV)-oxo bond is short, that is, 1.660-1.671 A, 




5R': Sngoo (Sn4oo) [Sn^oo] 

Figure 5. Optimised geometries of ^R' as obtained for Sh^qq, Sn^QQ and Sng, 
with bond lengths in Angstroms. 



which is indicative of a double bond and in agreement with 
previous calculations on related complexes.^^"^'^^^ The structures 
show little deviations between the three snapshots, and conse- 
quently the QM region is highly rigid and constraint. Further- 
more, the optimised geometries of the three snapshots impli- 
cates that the reproducibility of the results is high and that 
little movement in the central components of the protein and 
in particular the catalytic centre has occurred during the MD 
simulation. Further test calculations using different density 
functional methods were carried out, namely B3LYP, BBLYP^ 
BP86 and M06.^^^"^^^ In all cases a spin state ordering quintet < 
triplet < singlet was found with the quintet spin state as the 
ground state (see the Supporting Information). As the DFT 
methodology does not appear to give dramatic differences in 
spin-state ordering and relative energies, we decided to con- 
tinue with UB3LYP only. 

lron(IV)-oxo group isomerisation 

In the next set of calculations, we investigated as well as its 
barrier (^TS,) for the isomerisation into ^R'. A comparison of the 
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structure and electronic properties of ^R and ^R' reveals some 
interesting features that may affect the reactivity of these com- 
plexes. Let us start with a description of the electronic changes 
upon rotation from ^R to ^R'. Thus, the rotation of the oxo 
group from a position trans to H\s^^^ in ^R to a position trans 
to HisT87 in ^R' changes the shapes of the molecular orbitals. 
Figure 6 displays the valence orbitals of ^R and ^R', in which 
we used the nomenclature of Shaik, Solomon and co-workers 
for nonheme iron oxidants.^^^^ The labelling uses a molecular 
zaxis as taken along the Fe— 0 bond in ^R. In the lowest quin- 
tet spin state of ^R the metal 3d block of orbitals interact 
with ligands and split into a set of three Jt^ (jt\^, Jt^y^, Jt\y) or- 
bitals and a pair of two {o^^i, 0^2.^2) orbitals. The Jt^ orbitals 
represent the antibonding combinations of the 3d atomic orbi- 
tal on Fe with a Ip^/y orbital on oxygen, whereas the 0*^2 orbi- 
tal reflects the antibonding combination of 3d^2 on Fe with 
2p^ on 0. Finally, the 3d^2_y2 orbital on the metal forms anti- 
bonding combinations with ligands in the xy plane of symme- 
try. In pentacoordinated iron(IV)-oxo complexes the Jt\y and 
a\2_y2 orbitals are close in energy and hence a high-spin situa- 
tion is favourable.^^^^ Indeed, ^R, as found for analogous 
complexes,^^^'^^'^^'^"^^ has orbital occupation 

^o^/. By contrast, the nearest triplet spin 



yv ^ X7 V7 ^ y'' 



State is calculated to have 7i*xy^^*xz^ ^*yz configuration. 

Upon rotation of the 0x0 group, whereby ^R' is formed the 
molecular orbital interactions have altered. Thus, the Fe-0 
bond is now located along the molecular y axis and hence the 
labels of some of the orbitals have changed. In particular, the 
jt\y orbital in ^R' has a shape that matches the Jt\^ orbital in 
^R and the Jt\^ orbital in ^R' looks like the Jt\y orbital in ^R'. 
However, since both are singly occupied in ^R and ^R', that will 
not affect the relative energies of these isomers dramatically. 
The major differences between ^R and ^R' relate to the two 
orbitals, that is, the HOMO and LUMO orbitals, which, there- 
fore, also affect the reactivities. In ^R the 0^2.^2 is singly occu- 
pied and the o^^i orbital is virtual, whereas the ordering is re- 
versed for ^R' and the a^^2 orbital is singly occupied instead. In 
this orientation the a^^2 orbital in ^R' has lesser antibonding in- 



teractions than the HOMO, that is, 



in ^R and conse- 



quently ^R' is more stable than ^R. We calculated an isomerisa- 
tion energy difference of AE+ZPE = — 6.0 kcal mol"^ for Sh^qq 
at the UB3LYP level of theory. Changing the method to 
UB3LYP^ gives negligible changes to the energy difference be- 
tween ^R and ^R' and a value of —7.0 kcalmol"^ is obtained. 

Despite the fact that the orbital interactions have changed 
between ^R and ^R', there are actually very little changes in the 
group spin densities and charges of these complexes (see the 
Supporting Information). The spin density on the metal is 
slightly increased from 2.96 in ^R to 3.01 in ^R' and at the same 
time the oxygen atom loses spin density from 0.75 in ^R to 
0.69 in ^R'. Generally, the more radical character obtained at 
the oxygen atom, the more reactive a metal-oxo group is, and 
it may be anticipated that this small change of spin polarisa- 
tion from 0x0 to iron will make ^R' a slightly lesser oxidant 
than ^R. Our group spin densities and electronic state assign- 
ment matches those reported in previous DFT and QM/MM 
studies on analogous systems.^^^"^°^ 
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Figure 6. Orbital occupation of and ^R'. Also given are the natural orbitals for ^R and ^R'. 



In addition to differences in orbital interactions of the 
HOMO orbital in versus ^R' there are similar changes noted 
for the LUMO orbital. In ^R the LUMO orbital is the antibonding 
interaction along the Fe— 0 bond, which shows little involve- 
ment of other ligands. By contrast, the 0^2.^2 orbital displays 
interactions with four groups in the xy plane, namely the 0x0, 
AsPt33, HiSigy and succinate groups, and, therefore, it will be 
considerably higher in energy than the LUMO in ^R. Conse- 
quently, ^R' will have a much larger electron affinity than ^R 
and it will cost ^R more energy to abstract electrons from sub- 
strates. As hydrogen-atom abstraction is accompanied by 
a one-electron transfer from substrate to oxidant; this implies 
that ^R' will react with higher barriers than ^R. There must, 
therefore, be a fundamental reason for the enzyme to initiate 
the reaction with an isomerisation from ^R to ^R'. A possible 
reason is that the enzyme has to separate the dioxygen and 
substrate binding processes to avoid side reactions and by- 
products. Thus, after dioxygen binding to the iron centre an 
iron(lll)-superoxo complex is formed that reacts with aKG to 
form an iron(IV)-oxo and succinate.^^"^^ In several nonheme iron 
enzymes, however, the iron(lll)-superoxo complex is known to 
abstract hydrogen atoms from substrates directly. For instance, 
in the enzyme isopenicillin N synthase, the tripeptide 6-(L-a- 
amino adipoyl)-L-cysteinyl-D-valine, by four sequential hydro- 
gen-atom abstraction reactions, is converted into isopenicil- 
lin N through two ring-closure processes.^^^^ The first step of 
this reaction proceeds via an iron(lll)-superoxo intermediate. It 



may very well be that in AlkB repair enzymes the iron(lll)-su- 
peroxo has to be separated from substrate to prevent multiple 
hydroxylation reactions to occur on the methylated group, 
which would prevent the subsequent demethylation reaction 
into formaldehyde. 

To test the relative reactivity of an iron(lll)-superoxo versus 
an iron(IV)-oxo complex with A/^-methyladenine, we set up 
a DFT model complex based on the QM region displayed in 
Figure 4 and calculated the hydrogen-atom abstraction of 
both complexes. We found a hydrogen-atom abstraction barri- 
er of AE+ZPE = 32.8 kcalmol"^ for the iron(lll)-superoxo, 
whereas a value of AE+ZPE = 23.4 kcalmol"^ for the iron(IV)- 
0x0 was calculated. The latter matches the hydrogen-atom ab- 
straction barrier reported by Gauld et al. well although slightly 
different methods were used.^^^^ Thus, the energy difference 
between the two hydrogen-atom abstraction barriers implicate 
faster reactivity of the iron(IV)-oxo than the iron(lll)-superoxo 
complex. However, it is conceivable that the iron(lll)-superoxo 
reacts with aliphatic groups by abstraction of hydrogen atoms 
although it probably will be slow. 

Nevertheless, any reactivity of the iron(lll)-superoxo species 
would lead to the disruption of the catalytic cycle and the fail- 
ure to hydroxylate A/^-methyladenine to form A/^-hydroxym eth- 
yl adenine. It is, thus, very well possible that the enzyme has 
separated the oxygen binding and substrate binding positions 
in the active site of AlkB to prevent reactivity by the iron(lll)- 
superoxo species. The substrate is shielded from the iron(lll)- 
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superoxo group by an arginine residue (Arg2io) in the binding 
pocket. The superoxo group is too bulky to slide underneath 
Arg2io and attack the substrate, but once the iron(lll)-superoxo 
has reacted with aKG and formed an iron(IV)-oxo species an 
oxidant is formed that can migrate to a position trans to HiSigy 
and attack the substrate. This way the enzyme retains the re- 
gioselectivity of substrate hydroxylation and prevents the oc- 
currence of by-products. Most probably the enzyme pays 
a small thermodynamic price for this isomerisation process, 
which now happens with elevated hydrogen-atom abstraction 
barriers. 

Next we calculated the isomerisation barrier (^TS,) from to 
^R' in Sn5oo and the optimised geometry is given in Figure 7. 
Energetically, ^TS, is higher in energy than ^R by AE+ZPE = 
9.0 kcalmor\ QM/MM studies of the Borowski group^^"^^ on the 
nonheme iron halogenase SyrB2 were also predicted to start 




Figure 7. QM/MM optimised geometry of ^TS, using snapsiiot Spjc 
bond lengtiis in Angstroms. 



I witii 



with an isomerisation, whereby the positions of the oxo and 
halide ligands to the metal centre were interchanged. They cal- 
culated an isomerisation barrier of 13.4 kcalmol"^ for that pro- 
cess, which is not dramatically different from the value we 
found here. Our isomerisation is slightly lower in energy be- 
cause in AlkB only the oxo group migrates whereas in SyrB2 
both the 0X0 and halide groups interchange, which will raise 
the barrier heights. 

The optimised geometry of ^TS, shows some differences 
with respect to ^R and ^R'. Obviously the NHisT3T-Fe-0 angle 
changes along the rotation from 173.3° in ^R to 135.7° in ^TS, 
and to 97.2° in ^R'. In addition in ^TS, there is minor elongation 
of the two Fe-N distances with H'\s^^^ and HiSigy, but more dra- 
matically is the lengthening of the Fe— 0 bond to 1.721 A. This 
is because of a tight interaction of the oxo group with one of 
the protons of Arg2io, which is shortened from 2.414 A in ^R to 
1.888 A in ^TS| and then elongates again toward ^R' to a value 



of 2.470 A. The Arg2io residue, therefore, acts as a switch and 
assists with the rotation of the oxo group from trans to H\s^^^ 
to a position trans of HiSigy. 

Substrate hydroxylation by the iron(IV)-oxo complex 

We subsequently continued with calculations of the rest of the 
potential energy profile for the hydrogen-atom abstraction 
and followed by radical rebound to form alcohol product com- 
plexes (P). The hydrogen-atom abstraction passes a transition 
state (TSh) to form a radical intermediate (I) that is separated 
from products by a rebound transition state (TSreb). We calcu- 
lated the hydrogen-atom abstraction with QM/MM using snap- 
shot Sn3oo, Sn4oo and Sn^oo, whereas the rebound was only in- 
vestigated for SnsoQ. Despite the fact that we calculated the full 
potential energy profile of A/^-methyladenine hydroxylation on 
the singlet, triplet, quintet and septet spin state surfaces with 
QM/MM, actually only the quintet spin state is accessible for 
this part of the catalytic cycle and the other spin states are 
much higher in energy (see the Supporting Information). This 
is in agreement with previous studies of nonheme iron(IV)-oxo 
complexes in which single-state reactivity on a dominant quin- 
tet spin state surface was found.^^"^^^ Hence, we will focus on 
the quintet spin state results only and the other spin state 
structures and energetics are given in the Supporting Informa- 
tion for completeness. 

Figure 8 displays the complete potential energy profile from 
R to P as calculated with QM/MM in Snsoo- As mentioned 
above, the isomerisation barrier is about 9.0kcalmor^ and 
leads to the energetically more stable structure R'. Geometri- 
cally, the isomerisation step is assisted by Arg2io that hydrogen 
bonds with the oxo and succinate groups. Arg2io keeps the 
0X0 group in a specific orientation and guides its rotation 
toward the substrate. To highlight the motion of the Arg resi- 
due upon rotation from R to R' we have drawn a yellow box 
around its atoms. The isomerisation then brings the oxo group 
in close proximity to the substrate to enable a regioselective 
hydrogen-atom abstraction and prevent reactivity of its precur- 
sor the iron(lll)-superoxo species. The tight substrate binding 
pocket and the Arg-assisted isomerisation mechanism enable 
regioselective hydroxylation of the methylated DNA base with- 
out activation of any of the other C— H bonds in the substrate. 

We located two distinct pathways for hydrogen-atom ab- 
straction by ^R'. Hydrogen-atom abstraction from A/^-methyl- 
adenine by ^R' can either lead to electron transfer into the vir- 
tual a\2_y2 orbital, the so-called ^a-pathway, or to double occu- 
pation of the Ti^xz orbital through the so-called ^Jt-pathway.^^^^ 
Usually, in nonheme iron(IV)-oxo complexes the ^a-pathway is 
considerably lower in energy than the ^Jt-pathway, and, hence 
is the dominating quintet spin electron transfer mecha- 
nism.^^^'^"^'^^^ Generally, in the ^a-pathway the substrate attacks 
from the top and incurs little stereochemical interactions with 
the metal ligands, whereas in the ^Jt-pathway the substrate ap- 
proaches under an Fe-O-H angle of about 120° that is stereo- 
chemically disfavoured.^^^^ Due to the rotation of the iron(IV)- 
0X0 group and a change of orbital shapes (Figure 6), the ^o- 
pathway is not the dominating pathway anymore. We located 
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Figure 8. Potential energy landscape for the hydroxylation of A/^-methyladenine as calculated with QM/MM. Energies are obtained at UB3LYP/B2//UB3LYP/B1- 
Amber, whereas in parenthesis are given UB3LYPVB2//UB3LYP/B1 -Amber results. All energies contain zero-point corrections and are given in kcalmol \ 



transition states for both pathways, designated ^TSh,o and 
^TSHjt/ as well as the two isomeric radical intermediates, ^1^ and 

Based on the orbital diagram for and the unfavourable 
angle between the a^^2 and the substrate location, one might 
have expected high barriers for ^TSh,o but actually due to the 
change of the ordering of 0*^2 and 0^2.^2 upon rotation of the 
iron(IV)-oxo group from to ^R' the o-pathway is still a viable 
reaction pathway. Moreover, the ^Jt-pathway is stabilised and 
becomes competitive with the ^o-pathway. Nevertheless, the 
rate-determining step in the reaction mechanism in Figure 8 is 
the hydrogen-atom abstraction via barrier ^TSh, and both locat- 
ed transition states ^TSh^ and ^TSn^t originate from the same 
reactant structure ^R'. Thus, during the hydrogen-atom abstrac- 
tion an electron is transferred into the 0^2.^2 orbital in ^TSh ^ to 
form an exchange coupled radical intermediate (^IJ with 
jt\/ Jt^y/ jt\y^ a^^2^ a\2_y2^ (j)sub^ configuration. By contrast in 
^TSn^t a low-lying jt^ orbital is doubly occupied to give a 
radical intermediate (Hj with configuration 
jt\/ Jt^y/ jt\y^ a^^2^ a\2_y2° ())sub^. As the isomerisation has led to 
an inversion of the ordering of a^^2 and 0^2.^2 the a-pathway 
now involves electron transfer into the 0^2.^2 orbital and an 
alignment of the substrate along the molecular y axis. Indeed, 
an Fe-O-C angle of 141.4° is found for ^TSh,o/ whereas ^TS^^^, 
gives a much smaller angle of 131.4°. 

Because of the fact that the angle in ^TSh^ deviates signifi- 
cantly from the ideal angle of a linear Fe-O-C conformation 
due to constraints on the enzyme substrate and co-factor 
binding pocket, the a-pathway is destabilised. By contrast, the 
jt-pathway requires angles of typically 120° and indeed here 
we find an Fe-O-C angle of 131.4°. Because of the unfavoura- 
ble angle in the a-pathway, it is destabilised in energy and the 
Jt-pathway becomes competitive. This is fundamentally differ- 
ent from hydrogen-atom abstraction along the molecular 
zaxis as is normally the case for nonheme iron enzymes that 
appear to mostly react through the a-pathway. Note as well 



that the LUMO orbital in ^R, that is, the a''^2 orbital is lower in 
energy than the LUMO orbital in ^R', that is, the a\2_y2 which 
has considerably more antibonding interactions between the 
metal and its ligands, and, therefore, it requires more energy 
to fill it. 

Energetically in Snsoo, ^TSh,o and ^TS^^j, are of comparable en- 
ergies: 18.2 versus 18.6kcalmor^ at UB3LYP level of theory, 
and 15.0 versus 15.2 kcalmol"^ at UB3LYP^ level of theory with 
respect to ^R'. We also calculated ^R' and ^TSH,jt in Sn3oo and 
Sn4oo, but similar group spin densities and charges are found 
and a minor stabilisation of the barrier heights is observed: 
13.0kcalmor^ for Sn3oo and 15.1 kcalmol"^ for Sn4oo. As such 
there is a small effect of the protein on the barrier heights of 
the hydrogen-atom abstraction, although, geometrically they 
look very similar to the structure displayed in Figure 8 for Sh^qq 
(see the Supporting Information). Our values are also close, 
but slightly lower in energy than those reported by Cisneros 
and co-workers.^^^^ Furthermore, DFT studies using an active- 
site complex found a rate-determining hydrogen-atom abstrac- 
tion barrier of 20.9 kcalmol V^^^^ It appears, therefore, that the 
protein has a minor but stabilising effect on the transition 
state of the reaction. These high energetic barriers are caused 
through the isomerisation of the iron(IV)-oxo species, which 
raises the electron affinities and the corresponding hydrogen- 
atom abstraction barriers. 

Figure 9 gives QM/MM optimised geometries for hydrogen- 
atom abstraction transition states and intermediates on the 
competing a- and Jt-pathways. In both cases the hydrogen- 
atom abstraction is late on the potential energy surface with 
short 0-H distances of 1.141 and 1.160 A for ^TSH,a and ^TSh,^, 
whereas their corresponding H— C distances are considerably 
longer: 1.367 and 1.500 A, respectively. As previously shown,^^^^ 
late transition states generally correspond with high energetic 
barriers. Due to single occupation of the o*^2_y2 orbital in ^TSh,(j 
the Fe-0 and Fe-NHiSigy distances are somewhat longer than 
those in ^TSh,;,, in which this orbital is virtual. The Arg2io residue 
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Figure 9. Optimised geometries of ^TSh,^/ ^TSH,jt/ ^'o and ^1^, as obtained from 
QM/MM calculations with bond lengths in Angstroms. 



forms hydrogen bonding interactions with the carboxylate 
groups of succinate and AsPt33 as well as with the oxo group 
at a distance 1.877 (1.753) A for 'TSh,o ('TSh,J. 

After the hydrogen-atom abstraction the system relaxes to 
a radical intermediate (^1^^ or \), whereby the former has orbi- 
tal occupation ji^yj 7i*,J ji^J o*y2_,/ ^^^^^^ whereas it is 



t 



u x2-y2 M^Sub 

-y2 M^sub foi" the latter. The hydroxyl 
group in ^1^ and ^1^, is locked in hydrogen-bonding interactions 
with the carboxylate group of AsPt33 at a short distance of 
1.701 (1.781) A for % {%). As such it is not surprising that hy- 
droxyl rebound to the substrate radical gives a significant bar- 
rier: on the Jt-pathway we located a ^TS^eb barrier of 13.1 kcal 
mol"^ above ^1^,, although this is lower in energy than the hy- 
drogen-atom abstraction barrier of 18.6 kcal mol"^ for the Jt- 
pathway. This substantial rebound barrier was previously 
shown to lead to rearrangement patterns and the formation of 
by-products due to the relatively long lifetime of the radical in- 
termediate.^^^^ Despite the large rebound barriers, therefore, 
the hydrogen-atom abstraction step is still the rate-determin- 
ing step in the reaction mechanism. The large rebound barrier 
obtained in this work contrasts the result of DFT model calcu- 
lations,^^^^ in which a rebound barrier of just 0.6 kcalmol"^ was 
found. Clearly, the strong hydrogen-bonding interactions of 
the OH group to the carboxylate groups of succinate and 
AsPt33 and the methylguanidinium group of Arg2io stabilise the 



radical intermediates and raise the barrier for OH transfer to 
the radical. This highlights the importance of inclusion of part 
of the protein and particularly the hydrogen bonding network 
in the model. 

To find out whether the hydrogen abstraction barrier is de- 
pendent on the substrate, we also calculated it for A/^-methyl- 
guanine, A/^-methyladenine, A/^-methylcytosine and A/^-methyl- 
thymine using DFT model complexes and compared the re- 
sults with the QM/MM studies described above (Figure 10). 
Our calculated hydrogen-abstraction barrier for A/^-methylade- 



N^N'H 



0=< ,N 



/ 

.CH2 



NHp 



or 

AE* = 27.9 kcal mol"'' 
A/''-methylguanine 



H 

^2 NH2 
AE*= 6.5 kcal mor'' 

A/^-methylcytosine 



AE* = 23.4 kcal mor'' 
A/''-methyladenine 

CH3 



"2 o 
AE*= 10.9 kcal mor'' 

A/^-methylthymine 



Figure 10. DFT calculated hydrogen-atom abstraction barriers from methy- 
lated DNA bases as calculated with DFT model complexes. All energies are 
in kcalmol^ and contain ZPE and solvent corrections. 

nine of 23.4 kcal mol"^ is close in energy to the one reported 
by Gauld and co-workers.^^^^ We located hydrogen-atom ab- 
straction barrier heights from the A/-methyl positions of 
A/^-methylcytosine, A/^-methylguanine and A/^-methylthymine of 
AE+ZPE = 6.5, 27.9 and 10.9 kcal mol"\ respectively. To find 
out whether this trend correlates with the strength of the C— H 
bond that is broken for these substrates, we calculated the 
bond dissociation energy (BDE^h) for these bonds using proce- 
dures used before.^^^^ The BDEch value of substrates (SubH) was 
calculated from the energy difference of the isolated substrate 
with the sum of an isolated hydrogen atom and the substrate 
minus one hydrogen atom. As follows the BDEch values for 
these four substrates fall within a narrow window of about 
5kcalmor\ and should have resulted in very similar barrier 
heights for all substrates. Inspection of the optimised geome- 
tries (see the Supporting Information) shows that in the DFT 
model complexes the substrates have reoriented in several 
cases and formed multiple hydrogen-bonding interactions 
with the carboxylic acid groups of succinate and AsPt33 as well 
as the oxo-group. As shown before,^^^^ hydrogen-bonding inter- 
actions reduce catalytic efficiency and raise hydrogen-abstrac- 
tion barriers and therefore these model complexes are not 
a good representative of the catalysis in the actual enzyme. 
Thermodynamically, the AlkB repair enzyme should, therefore, 
also be able to hydroxylate, and, consequently, repair A/^-meth- 
ylcytosine and A/^-methylthymine. 
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Conclusion 

In this work we report a series of QM/MM and DFT studies on 
the catalytic mechanism of substrate activation by AlkB repair 
enzymes. We analysed PDB structures and attempted to insert 
a dioxygen molecule in several positions and found only one 
feasible binding position to iron trans to H\s^^^. The iron(lll)-su- 
peroxo species is separated from the substrate by a considera- 
ble distance and its approach is blocked by an Arg residue. 
This Arg2io residue acts as a latch and only allows the isomeri- 
sation of iron(IV)-oxo and prevents the iron(lll)-superoxo from 
reacting with substrate. It is proposed that the iron(lll)-super- 
0X0 reacts with aKG to form iron(IV)-oxo, succinate and CO2. 
The iron(IV)-oxo initially has the 0x0 trans to H\s^^^ and isomer- 
ises to a position trans to HiSigy. The isomeric iron(IV)-oxo spe- 
cies was found to react via hydrogen-atom abstraction on 
competing ^o- and ^Jt-pathways to form a radical intermediate 
followed by rebound to give alcohol products. The hydrogen- 
atom abstraction is the rate-determining step in the reaction 
mechanism. 

The studies presented in this work give important insight 
into the substrate and oxygen binding channels in the 
enzyme. We show that it is essential to separate the substrate 
and oxygen binding channels as otherwise the iron(lll)-super- 
0x0 will react with substrate and prevent the repair reaction of 
the DNA. We also show that the isomerisation reorganises the 
high-lying occupied and low-lying virtual orbitals and, thereby, 
affects the electron transfer abilities of the oxidant. The isomer- 
ic iron(IV)-oxo reacts with substantially larger barriers as 
a result, which is a price the enzyme pays for separating the 
substrate and dioxygen binding channels. 



Experimental Section 

For the set-up of the QM/MM system we used well-tested and 
benchmarked methods as reported before.^^^^ Starting from the 
3120 PDB file,^^"^^ hydrogen atoms were added to the structure 
using the PDB2PQR program package^'^^^ and the active site was 
manually modified from the iron(ll) — water aKG complex into an 
iron(IV)-oxo succinate (Succ) active site. Apart from the two histi- 
dine groups that are bound to the metal, all other histidine side 
chains were doubly protonated. Furthermore, we made sure that 
all arginine and lysine side chains were protonated and all gluta- 
mic acid and aspartic acid side chains were deprotonated. This re- 
sulted in a structure with overall neutral charge. Solvent (with 
sphere of radius of 35 A) was added to this structure, and equili- 
brated, followed by a molecular dynamics minimisation and heat- 
ing procedure to 298 K of the full structure using the CHARMM 
force field.^^°^ The total model has 10011 atoms and includes 2269 
TIP3P water molecules. We selected several snapshots from this 
MD simulation as starting points for the QM/MM calculations at dif- 
ferent time intervals. The iron(IV)-oxo reactant was geometry opti- 
mised in all low-lying and accessible spin states, that is, singlet, 
triplet, quintet and septet, and the spin state ordering and relative 
energies of these snapshots at 300, 400 and 500 ps {Sh^qq, Sh^qq, 
Snsoo) gave consistent and reproducible results. Subsequently, we 
investigated the full potential energy profile with Snsoo as well as 
the hydrogen-atom abstraction step by Sh^qq and Sn4oo. We chose 
to start the work from the iron(lll)-superoxo complex and studied 



the mechanism until formation of alcohol product complexes as 
this procedure means the number of atoms in the model stays 
constant during the reaction. 

QM/MM calculations employed the ONIOM program package as 
implemented in Gaussian 09.^^^^ The QM region was described by 
density functional theory and the unrestricted B3LYP functional,^^^^ 
while we used the Amber force field for the MM region.^"^^^ The QM 
region contained the iron(IV)-oxo group, methylimidazole groups 
for HisT3T and HiSigy, acetate for succinate and AsPt33, methylguani- 
dinium for Arg2io, and the A/^-methyladenine part of the substrate. 
All structures described here are the result of a full QM/MM geom- 
etry optimisation of all degrees of freedom. Stationary points were 
characterised by running an analytical frequency calculation on the 
QM region only at the same level of theory. Geometry optimisation 
and frequency were carried out with a double-^ quality LACVP 
basis set on iron that contains a core potential and 6-31 G on the 
rest of the atoms (basis set Bl).^"^^^ Single-point energy calculations 
with a Wachters all-electron basis set on iron and 6-31 +G^ on the 
rest of the atoms (basis set B2) was carried out on the optimised 
geometries at QM/MM to improve the energetics.^^"^^ All energies 
reported in this work were obtained with basis set B2 and include 
ZPE corrections. Previous studies of our group on nonheme 
iron(IV)-oxo complexes showed these methods to be sufficiently 
accurate to match experimentally determined free energies of acti- 
vation.^^"^^ We used electronic embedding procedures whenever 
possible. To ascertain that the obtained results are reproducible we 
also calculated single-point energies using the UB3LYP^ density 
functional in which the amount of HF exchange was reduced to 
15%,^^^^ as well as single points using either the BP86 or M06 meth- 

To gain further insight into the reaction kinetics and energetics we 
supplemented our studies with DFT model calculations in which 
we took a model containing iron(IV)-oxo with two imidazole and 
two acetate groups as the active species of the enzyme in analogy 
to previous model calculations.^^^'^"^^^ We then treated this oxidant 
with methylated DNA bases: A/^-methyladenine, A/^-methylguanine, 
A/^-methylcytosine and A/^-methylthymine. The geometries were 
optimised with UB3LYP/B1 and characterised with a frequency cal- 
culation. Single-point calculations were carried out with UB3LYP/B2 
and solvent corrections were included. 

We also calculated a DFT model complex based on the above- 
mentioned QM region for an iron(lll)-superoxo complex and calcu- 
lated the hydrogen-atom abstraction from A/^-methyladenine at 
UB3LYP/B1. Geometries and frequencies were carried out in Gaussi- 
an using this level of theory and energies were corrected with 
single points at UB3LYP/B2 and solvent corrections. 
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